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Abstract: We obtain rates of convergence in limit theorems of partial sums Sn for certain 
sequences of dependent, identically distributed random variables, which arise naturally in sta- 
tistical mechanics, in particular, in the context of the Curie- Weiss models. Under appropriate 
assumptions there exists a real number a, a positive real number /i, and a positive integer k such 
that {Sn — na)/n^~^/^'^ converges weakly to a random variable with density proportional to 
exp(— /i|a;p'^/(2fc)!). We develop Stein's method for exchangeable pairs for a rich class of distri- 
butional approximations including the Gaussian distributions as well as the non-Gaussian limit 
distributions with density proportional to exp(— ^|xp'^/(2fc)!). Our results include the optimal 
Berry-Esseen rate in the Central Limit Theorem for the total magnetization in the classical 
Curie- Weiss model, for high temperatures as well as at the critical temperature f3c = I, where 
the Central Limit Theorem fails. Moreover, we analyze Berry-Esseen bounds as the temper- 
ature l//3n converges to one and obtain a threshold for the speed of this convergence. Single 
spin distributions satisfying the Griffiths- Hurst-Sherman (GHS) inequality like models of liquid 
helium or continuous Curie- Weiss models are considered. 
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1. Introduction and main result 

There is a long tradition in considering mean-field models in statistical mechanics. The Curie- Weiss 
model is famous, since it exhibits a number of properties of real substances, such as multiple phases, 
metastable states and others, explicitly. The aim of this paper is to prove Berry-Esseen bounds for the 
sums of dependent random variables occurring in statistical mechanics under the name Curie- Weiss 
models. To this end, we will develop Stein's method for exchangeable pairs (see |22j ) for a rich class 
of distributional approximations. For an overview of results on the Curie-Weiss models and related 
models, see [TO], [E], [H]. 

For a fixed positive integer d and a finite subset A of Z'^, a ferromagnetic crystal is described by 
random variables which represent the spins of the atom at sites z G A, where A describes the 
macroscopic shape of the crystal. In Curie-Weiss models, the joint distribution at fixed temperature 
T > of the spin random variables is given by 

PA,f3i{xi)) := PA,/3((X,^)ieA = (xi)ieA) := ^^)^) 11 M^t)- (1-1) 

Here f3 := T^^ is the inverse temperature and Z\(f3) is a normalizing constant known as the partition 
function and |A| denotes the cardinality of A. Moreover g is the distribution of a single spin in the 
limit P ^ 0. We define 5a = ^if^A^j^, the total magnetization inside A. We take without loss of 
generality d = I and A = {1, . . . ,n}, where n is a positive integer. We write n, X^"\ Pnfi and Sn, 
respectively, instead of |A|, X^, PA,f3, and S^, respectively. In the case where (3 is fixed we may even 
sometimes simply write Pn- 

We assume that g is in the class B of non-degenerate symmetric Borel probability measures on M 
which satisfy 

exp ] dg{x) < oo for all 6 > 0. (1.2) 



2 

In the classical Curie-Weiss model, spins are distributed in {— 1,+1} according to g = ^{S-i +Si). 
More generally, the Curie- Weiss model carries an additional parameter h > called external magnetic 
field which leads to the modified measure, given by 

PnM^) = ^M^Sl + l3hSn) dg^^ix), X = {Xi). 

Zn,f3,h 2n 

The measures Pn,p,h is completely determined by the value of the total magnetization. It is therefore 
called an order parameter and its behaviour will be studied in this paper. The non-negative external 
magnetic field strength may even depend on the site: 

1 /3 " 

Pn,l3M,-,hn{x) = exp( — 5^ + /? /ijXj) d£l'^"(x), X={Xi). (1.3) 

^n,/3,/il,...,/i„ ^ 

In the general case (II. ip . we will see (analogously to the treatment in [12^ I14j) that the asymptotic 
behaviour of Sn depends crucially on the extremal points of a function G (which is a transform of the 
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rate function in a corresponding large deviation principle): define 

(pg{s) := log J exp(s x) (i£)(a;) 



G,i(3,s):=^-<P,iPs). (1.4) 



and 

2 

We shall drop /3 in the notation for G whenever there is no danger of confusion, similarly we will 
suppress g in the notation for (p and G. For any measure g £ B, G was proved to have global minima, 
which can be only finite in number, see \12\ Lemma 3.1]. Define C = C^, to be the discrete, non-empty 
set of minima (local or global) of G. If a E C, then there exists a positive integer k := k{a) and a 
positive real number fi := ^{a) such that 

G{s) = G{a) + ^(")(^ ~ ")^^ ^ o{{s - af^+^) as s ^ a. (1.5) 
(2fe)! 

The numbers k and ^ are called the type and strength, respectively, of the extremal point a. Moreover, 
we define the maximal type k* of G by the formula 

k* = max{A;(a); a is a global minimum of G}. 

Note that the ju(a) can be calculated explicitly: one gets 

/i(a) = /3-/3V'(/3a) if A; = 1 (1.6) 

while 

^{a) = -/^^'^(/^(^fc) (/J a) if A: > 2 (1.7) 

(see [H]). 

An interesting point is, that the global minima of G of maximal type correspond to stable states, 
meaning that multiple minima represent a mixed phase and a unique global minimum a pure phase. 
For details see the discussions in [E 



The following is known about the fluctuation behaviour of Sn under P„. In the classical model {g 
is the symmetric Bernoulli measure), for < /3 < 1, in [T^] the Central Limit Theorem is proved: 

S^^Ar(0,a2(/3)) 

in distribution with respect to the Curie- Weiss flnite volume Gibbs states with cr^(/3) = (1 — (3)~^. 
Since for (3 = 1 the variance cr^(/3) diverges, the Central Limit Theorem fails at the critical point. In 
[12j it is proved that for (3 = 1 there exists a random variable X with probability density proportional 
to exp(— ^x^) such that as n — > oo 

Y(ji=l ^ 



in distribution with respect to the finite-volume Gibbs states. Asymptotic independence properties 
and propagation of chaos for blocks of size o(n) have been investigated in [2]. 
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In general, given g £ B, let a be one of the global minima of maximal type k and strength fi of Gg. 
Then 

Sn - na 
ni-i/2k ~^ 

in distribution, where X^^^^p is a random variable with probability density fk,^,i3, defined by 

= exp(-xV2cT2) (1.8) 

and for k >2 

exp(-/xx2V(2fc)!) 
- jexp(-^x2V(2A:)!) dx' ^ ' 

Here, cj^ = i - i so that for = ii{a) as in ([L6]), cj^ = ([0"(/?a)]"^ - /3)"^ (see [H], [H]). Moderate 
deviation principles have been investigated in [7]. 

In [TT] and [H], a class of measures g is described exhibiting a behaviour similar to that of the 
classical Curie-Weiss model. Assume that g is any symmetric measure that satisfies the Griffiths- 
Hurst-Sherman (GHS) inequality, 

^(/)g(s) < for all s > 0, (1.10) 

(see also |131ll6j). One can show that in this case G has the following properties: There exists a value 
/3c, the inverse critical temperature, and G has a unique global minimum at the origin for < /3 < /?c 
and exactly two global minima, of equal type, for (3 > /3c- For (3c the unique global minimum is of type 
k > 2 whereas for /3 G (0, (3c) the unique global minimum is of type 1. At /3c the law of large numbers 
still holds, but the fluctuations of Sn live on a smaller scale than ^/n. This critical temperature can 
be explicitly computed as (3c = 1/0" (0) = l/Varg(Xi). By rescaling the Xi we may thus assume that 
/3c = 1. 

Alternatively, the GHS-inequality can be formulated in the terms of Zn^/3^hi,...,h„j defined in (|1.3p : 

= E{XiXjXk) - nX^mXjXk) - nXjMX^Xk) (1.11) 
-¥.{XkW{XiXj) + 2E(X,)E(Xj)E(Xfc) 

for all (not necessarily distinct) sites k G {1, . . . ,n}. Here E denotes the expectation with respect 
to Pn,i3,hi,---,h„- The GHS inequality has a number of interesting implications, see [TT] . 

With GHS, we will denote the set of measures g £ B such that the GHS-inequality p.lU|) is valid 
(for Pn,p,hi,...,hn the sense of (|1.11|) ). We will give examples in Section 7. 

Remark 1.1. In [12\ Lemma 4.1], for ^ G i3 it is proved that G has a unique global minimum if and 
only if 

J exp{s x) dg{x) < exp(s^/2), for s real. 
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where the right hand side of this strict inequahty is the moment generating function of a standard 
normal random variable. Moreover, in the same Lemma it is proved that G has a local minimum at 
the origin of type k and strength ^ if and only if 



Here ^ij{Q) and /ij define the j'th moment of q and the j'th moment of a standard normal random 
variables, respectively. Note that this in particular implies = Ep(Xi) = 0. 

The aim of this paper is to prove the following theorems: 
1.1 Results for the classical Curie- Weiss model 

Theorem 1.2 (classical Curie- Weiss model, optimal Berry-Esseen bounds outside the critical tem- 
perature). Let Q = + |(5i and < /? < 1. We have 



where denotes the distribution function of the normal distribution with expectation zero and vari- 
ance (1 — /3)~^, and C is an absolute constant, depending on (3, only. 

Theorem 1.3 (classical Curie- Weiss model, optimal Berry-Esseen bounds at the critical temperature). 

Let Q = + ^Si and (3=1. We have 




for j = 0,l,...,2A;-l 
for j = 2k. 




(1.12) 




(1.13) 



where 




(1.14) 



Z := exp(— a;^/12) dx and C is an absolute constant. 



Theorem 1.4 (Berry-Esseen bounds for size-dependent temperatures). Let g = 2^-1 + '^^'^ 
< /3n < 00 depend on n in such a way that /?„ ^ 1 monotonically as n ^ 00. Then the following 
assertions hold: 



(1) Ifdn-l 



for some 7 7^ 0, we have 




(1.15) 



with 




where Z := /iRexp(— ^2 + dx and C is an absolute constant. 



(2) If \l3n — 1| ^ n ^^"^ , Sn/n^^^ converges in distribution to F, given in (11. 14^ . Moreover, \ 



- 1| = 0{n~^), (prT3]) holds true. 
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with an absolute constant C. 

Remark 1.5. In [1], Barbour obtained distributional limit theorems, together with rates of conver- 
gence, for the equilibrium distributions of a variety of one-dimensional Markov population processes. 
In section 3 he mentioned, that his results can be interpreted in the framework of [12j. As far as we 



Remark 1.6. In the first assertion of Theorem 11.41 our method of proof allows to compare the 
distribution of Sn/n^^* alternatively with the distribution with Lebesgue-density proportional to 




To be able to compare the distribution of interest with a distribution depending on n (on /?„), is one 
of the advantages of Stein's method. The proof of this statement follows immediately from the proof 
of Theorem II. 4i 

If in Theorem ll.4l (2) — 1| » n^^ the speed of convergence reduces to 0{^/n\l — /9n|)- Likewise, 
if in Theorem O (3) - 1| < n"^/^, the speed of convergence is 0{ ^^^\p j ). This reduced speed of 
convergence reflects the influence of two potential limiting measures. Next to the "true" limit there 
is also the limit measure from part (1) of Theorem II. 4^ which in these cases is relatively close to our 
measures of interest. 

1.2 Results for a general class of Curie- Weiss models 

More generally, we obtain Berry-Esseen bounds for sums of dependent random variables occurring 
in the general Curie- Weiss models. We will be able to obtain Berry-Esseen-type results for ^3-a.s. 
bounded single-spin variables Xi: 

Theorem 1.7. Given g £ B in GHS, let a he the global minimum of type k and strength fj, of Gg. 
Assume that the single-spin random variables Xi are bounded g-a.s. In the case k = 1 we obtain 



where W := Sn/\/n and <I>iy denotes the distribution function of the normal distribution with mean 
zero and variance E(H^^) and C is an absolute constant depending on < /3 < 1. For k > 2 we obtain 



understand, his result (3.9) can be interpreted as the statement (11.13^ . but with the rate n 



1/4 




(1.16) 




(1.17) 
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where Fw,k{z) := Jl^ fw,k{x) dx with fw,k defined by 



) 



/exp(- 



) dx 



with W :- 



^1-1/ 2k ^''^d Ck is an absolute constant. 



Theorem 1.8. Let g £ B satisfy the GHS-inequality and assume that /3c = 1. Let a be the global 
minimum of type k with k > 2 and strength fik of Gg and let the single-spin variable Xi be bounded. 
Let < (3n < oo depend on n in such a way that — > 1 monotonically as n ^ oo. Then the following 
assertions hold true: 



(1) If I3n — 1 = — [3T for some 7 / 0, we have 



sup 

zeM 



Sn 



na 



ni-i/2k 



< z 



Fw,k,-tiz) 



< CkU 



-l/k 



(1.18) 



with 



Fw,k,^{z) := ^ j exp^-c^^ 



V(2A^)! 

where Z := /]r exp(— (t^^x^'^ — -^x^)) dx, with W :- 



cw 



7 2 



,1-1/2*; 7 



dx. 



l^k 
{2k)\ 



and Ck is an absolute constant. 

(2) If \f3n — 1| ^ n^^^"^/^\ ^i-ij2k converges in distribution to Fyy ^, defined as in Theorem \1.7[ 
Moreover, i/ |/?„ - 1| =0{n~^), p^T]) holds true. 

(3) If\f3n - 1| > n-(i-i/^), the Kolmogorov distance of the distribution ofW := \/^^Y17=i^i 
and the normal distribution N{0,¥,{W'^)) converges to zero. Moreover, if \f3n — 1| ^ 
n^^-'^/^^-'^/^'^), we obtain 



sup 



< Z 



n 



with an absolute constant C . 



Remark 1.9. Since the symmetric Bernoulli law is GHS, Theorems ll.7l and ll.8l include Berry-Esseen 
type results for this case. But these results differ from the results in Theorem II. 2( 11.31 and 11.41 with 
respect to the limiting laws: the laws in ll.7l and ll.8l depend on moments of W . The bounds in Theorems 
ll.2til.4l are easier to obtain; moreover their proofs apply Corollary 12.81 and part (2) of Theorem 14.61 
which are less involved versions of Stein's method for exchangeable pairs. 



For arbitrary g £ GHS we are able to proof good bounds with respect to the Wasserstein-metric. 
For any class of test functions TC, a distance on probability measures on R can be defined by 



dn{P,Q) = sup 

hen 



hdP 



hdQ 
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The class of test functions h for the Wasserstein distance is just the Lipschitz functions Lip(l) with 
constant no greater than 1. The total variation distance is given by the set 7i of indicators of Borel 
sets, the Kolmogorov distance dx by the set of indicators of half lines. 

Only for technical reasons, we consider now a modified model. Let 

Pn,f3,hi^) = ^ expf- ^ XiXj + ph'^Xij dg'^"'{x), X = (xi) 

^ l<i<7<n i=l ' 



Zn,l3,h V" l<i<j<n 

Theorem 1.10. Given the Curie-Weiss model Pn,p CLnd g €z B in GHS, let a he the global minimum 
of type k and strength fi of Gg. In the case k = 1, for any uniformly Lipschitz function h we obtain 
for W = Sn/y/n that 

\E{h{W)) -<Pw{h)\<\\h'\\C' ^ ' ' ' ^' ^ 



n 



Here C is a constant depending on < (3 < 1 and ■= J^h{z)^\Y{dz). The random variable 

X'- is drawn from the conditional distribution of the i'th coordinate Xi given {Xj)j^i (this choice will 
be explained in Section 3). For k > 2 we obtain for any uniformly Lipschitz function h and for 



XT/ Sn-na 

• „l-l/2k 



I / . xN - . M n nJ 1 C2 max E|XiP,E|Xi'P)\ 
Here Ci,C2 are constants, and Fv/,k{h) '■= f^h{z)Fw^k{dz). 

Remark 1.11. Assume that there exists a 5 such that for any uniformly Lipschitz function h, 
\Kh(W) — F{h)\ < 5\\h'\\, where W is a random variable, F{h) := J^h{z)F{dz) for some distri- 
bution function F, then from the definition of the Wasserstein distance it follows immediately that 
sup/igLip(i) |IE/i(VF) — F{h)\ < 5. Moreover, the Kolmogorov distance sup^ < z) — F{z)\ can 

be bounded by cpS^^'^, where Ci? is some constant depending on F (the proof follows the lines of [6l 
Theorem 3.1]). 



Remark 1.12. In [TT], the distribution of the spins g are allowed to depend on the site. They define 
a subclass Q oi B such that for . . . , £ Q the GHS inequality holds. In Section 7 we present 
a large class of measures which belong to Q (see |1H Theorem 1.2]). The GHS inequality itself has 
a number of interesting implications like the concavity of the average magnetization as a function 
of the external field h or the monotonicity of correlation length in Ising models. These and other 
implications can be found in |llj and references therein. Note that for g G GHS, 0g(s) < for 
all real s, where = j^x"^ g{dx). These measures are called sub-Gaussian. Very important for our 
proofs of Berry-Esseen bounds will be the following correlation-inequality due to Lebowitz [18]: If 
E denotes the expectation with respect to the measure Pn,f3,hi,...,h„, one observes easily that for any 
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g E B and sites i,j,k,l £ {1, . . . , n} the following identity holds: 

^3 



-E{Xi) 



(1.19) 

all /ij=0 



dhi dhj dhk 

= ¥.{X,X,XkXi) - ¥.{X,Xj)¥.{XkXi) - E(X,Xfc)E(X,XO - E(XiXz)E(X,-Xfc). 

Lebowitz [18] proved that if ^ G GHS, then (fLTQl) is non-positive (see [101 V.13.7.(b)] and [17]). 
Stein's method reduces to the computation of, or bounds on, low order moments, perhaps even only 
on variances of certain quantities. Such variance computations can be very difficult. We will see in 
the proof of Theorem 1 1 . 71 and Theorem 11.81 the use of Lebowitz' inequality for bounding the variances 
successfully. 

In the situation of Theorem 11.71 and Theorem 11.81 we can bound higher order moments as follows: 

Lemma 1.13. Given g £ B, let a be one of the global minima of maximal type k for k > 1 and 
strength n of Gg. For 

• „l-l/2fc 

we obtain for any I G N 

E|W^|' < const. (0- 

We prepare for the proof of Lemma 11.131 It considers a well known transformation - sometimes 
called the Hubbard-Stratonovich transformation - of our measure of interest. 

Lemma 1.14. Let m G M and < 7 < 1 6e real numbers. Gonsider the measure Qn.0 '■= {Pn ° 
i^^y where MiO, 

BrJi-i ) denotes a Gaussian random variable with mean 
and variance p^l-i-i ■ Then for all n > 1 the measure Qn,/3 is absolutely continuous with density 

exp (-nG(^^ + m)) 



zero 



/jjexp (-nG(p^ + m)) ds' 

where G is defined in equation (jl.4p . 



(1.20) 



Remark 1.15. As shown in [12j, Lemma 3.1, our condition (jl.2p ensures that 

exp(-nG(-^ + m))ds 
is finite, such that the above density is well defined. 

Proof of Lemma \1.14\ The proof of this lemma can be found at many places, e.g. in [12], Lemma 
3.3. □ 

Proof of Lemma \1.13[ We apply the Hubbard-Stratonovich transformation with 7 = 1 — 1/2A;. It 
is clear that this does not change the finiteness of any of the moments of W. Using the Taylor 
expansion (11. 5p of G, we see that the density of Qn,i3 with respect to Lebesgue measure is given by 
Const. exp(—x^'^) (up to negligible terms, see e.g. [12], [7]). A measure with this density, of course, 
has moments of any finite order. □ 
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Remark 1.16. As we will see, we only have to bound E(T4^^) in the classical model, when < /3 < 
1. This can be obtained directly using the definition of P„ and Taylor-expansion. But already for 
the classical model, for /3 = 1, it is quite cumbersome to bound higher order moments via direct 
calculations. 

In Section 2, we develop in Theorem 12.51 Corollary 12.81 and Corollary 12.91 refinements of Stein's 
method for exchangeable pairs in the case of normal approximation. As a first application we prove 
Theorem [L2] in Section 3. In Section 4 we develop Stein's method for exchangeable pairs for a rich class 
of other distributional approximations. Obtaining good bounds for the solutions of the corresponding 
Stein equations in the appendix, we prove Theorem 11.31 and Theorem 11.41 in Section 5, applying 
Theorem 14.61 In Section 6, we proof Theorems ll. 7111-81 and ll.lOl applying Corollary 12.91 and Theorem 
14.71 Section 7 contains a collection of examples including the Curie- Weiss model with three states, 
studying liquid helium, and a continuous Curie- Weiss model, where the single spin distribution ^ is a 
uniform distribution. 

2. Stein's method with exchangeable pairs for normal approximation 

Stein introduced in [22] the exchangeable pair approach. Given a random variable W, Stein's method 
is based on the construction of another variable W (some coupling) such that the pair (W, W) is 
exchangeable, i.e. their joint distribution is symmetric. The approach essentially uses the elementary 
fact that if (VF,H^') is an exchangeable pair, then 'Eg{W,W') = for all antisymmetric measurable 
functions g{x,y) such that the expectation exists. A theorem of Stein ([22], Theorem 1, Lecture III]) 
shows that a measure of proximity of W to normality may be provided in terms of the exchangeable 
pair, requiring W' — to be sufficiently small. He assumed the linear regression property 

E(M/'|H^) = {l-X)W 

for some < A < 1. This approach has been successfully applied in many models, see [22j and for 
example |23j and references therein. In [19], the range of application was extended by replacing the 
linear regression property by a weaker condition, allowing to hold the regression property only approx- 
imately. The exchangeable pair approach is also successful for other distributional approximations, as 
will be shown in Section 4. We develop Stein's method by replacing the linear regression property by 

E(Ty'|VF) = W + A V(VF) + R{W), 

where ^p{x) will be depend on a continuous distribution under consideration. Before we consider in this 
section the case of normal approximation, we mention that this is not the first paper to study other 
distributional approximations via Stein's method. For a rather large class of continuous distributions, 
the Stein characterization was introduced in [23], following [22\ Chapter 6]. In [23], the method of 
exchangeable pairs was introduced for this class of distribution and used in a simulation context. 
Recently, the exchangeable pair approach was introduced for exponential approximation in [4| Lemma 
2.1]. 
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For measuring the distance of the distribution of W and the standard normal distribution (or any 
other distribution), we would like to bound 

\m{w) - $(/i)| 

for a class of test functions h £ Ti., where ^{h) := h{z)^{dz) and $ is the standard normal 
distribution function. One advantage of Stein's method is that we are able to obtain bounds for 
different distances like the Wasserstein distance d^, the total variation distance dxv or the Kolmogorov 
distance dx- In [TM, the exchangeable pair approach of Stein was developed for a broad class of non 
smooth functions h, applying standard smoothing inequalities. 

They proved the following: 

Theorem 2.1 (Rinott, Rotar: 1997). Consider a random variable W withE{W) = andE{W'^) = 1. 
Let (W,W') be an exchangeable pair (i.e., their joint distribution is symmetric). Define a random 
variable R = R{W) by 

E{W'\W) = {l-X)W + R, 
where X is a number satisfying < A < 1 . // moreover 

\W' -W\ <A 

for a constant A. Then one obtains 



snY>\P{W < z)-<^{z)\ < — Vvar{E[(W^' - Wf\W]} + 'i7^^ — + AS^/lJ^— + ^/ij^^. (2.1) 
zeR A A A vA 

Remark 2.2. Rinott and Rotar also proved a bound in the case, where \W' — W\ is not assumed to 
be bounded. In this case, the last two summands on the right hand side of ()2.ip have to be replaced 

by 



-W\^. 

A 



This estimation is crude, since even for a normalized sum of n independent variables W , it leads to a 
bound of the order n"^/^. The advantage of the results in [TO] is, that these bounds do not only apply 
to indicators on half lines, but also to a broad class of non smooth test functions, see |19j Section 1.2]. 

Chen and Shao introduced a concentration inequality approach. Here a concentration inequality is 
proved using the Stein identity (see [5] and [6] ) . In the context of the construction of an exchangeable 
pair, in [20] Shao and Su proved the following theorem: 

Theorem 2.3 (Shao, Su: 2005). Let W be a random variable with 'E{W) = and E(VF^) < 1 and 
{W, W') be an exchangeable pair such that 

E{W'\W) = (1 - \)W 
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with < A < 1, then for any a > 



sup \P{W < z)-Hz)\ < 1 - ^E{{W - WyiW)) +^:^+l.5a+^E{{W-W')l{\w-W'\>a})- 



2A " 7 A 2A 

(2.2 

If \W — W'\ < A, then the bound reduces to 



/ / 1 \^ 41^3 

sup \P{W < z) -^{z)\ < \ Ell- —E{{W -W'y\W)] +— + 1.5^. (2.3) 

zm V V 2A J X 

Remark 2.4. When \W — W'\ is bounded, (|2.3p improves (j2.ip with respect to the constants. 

Following the lines of the proofs in [19j and [20j, we obtain the following refinement: Given two 
random variables X and Y defined on a common probability space, we denote by 

dKiX,Y) := sup\P{X < z) - P{Y < z)\ 
the Kolmogorov distance of the distributions of X and Y. 

Theorem 2.5. Let {W,W') be an exchangeable pair of real-valued random variables such that 

E{W'\W) = (1 - X)W + R 

for some random variable R = R{W) and with < A < 1. Assume that E(VF^) < 1. Let Z be a 
random variable with standard normal distribution. Then for any A> {), 



dK{W,Z) < JE[l-^E[{W' -WY\W]\ + f 4^ + l.5A^ ^^^^ 



A 2A 

If \W — W'\ < A for a constant A, we obtain the bound 



2X ' ' 'J \ 4 J X 

41A'^ 1 
+ — r— + 1.5/1 + :r^E((W - W')%,v.w.,>A})- 



4<(W, Z) < ^e(i - ±mv - wnvi) + + 1.5^) ^ + ^ + 1.5A (2.4) 

Remark 2.6. When \W — W'\ is bounded, (j2.4p improves ()2.ip with respect to the Berry-Esseen 
constants. 

Proof. We sketch the proof: For a function / with |/(x)| < C(l + |x|) we obtain 

= E{{W-W'){f{W') + f{W))) 

= E{{W-W'){f{W')-f{W)))+2XE{Wf{W))-2E{f{W)R). (2.5) 

Let f = fz denote the solution of the Stein equation 

fUx) - xf,{x) = l|,<4(x) - ^{z). (2.6) 
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We obtain 

P{W <z)- ^{z) = E{f'{W) - Wf{W)) 

= EifiW)) - - W'){fiW) - fiW'))) - jE{f{W) R) 

= E(^f'{W){l-j^{W-W'f)^ +E(^f'iW)j^iW-W'f^ 

-^E{{W - W'){f{W) - f{W'))) - jE{f{W) R) 

= E(^f\W){l - ^{W -W'f)^ - ^E{2f{W)R) 

(w - w'){fiw) - fiW) -{w- w')nw)) 

=: T1 + T2 + T3. (2.7) 
Using < 1 for all real x (see [6l Lemma 2.2]), we obtain the bound 



2A 



iTil < \lE{l-j^E[{W' -W)W] 



Using < f{x) < \/27r/4 (see [U Lemma 2.2]), we have 

1^21 < ^E{\R\) < ^vnm- 

Bounding T3 we apply the concentration technique, see [20j : 



(-2A)r3 = e({W-W')^w-W'\>a} ^ {f'{W + t)-f'iW))dt 

V j-(w-w') J 

+ E((^-iy')l{|iy-iy'|<A} r {r[W ^t)-f\W))dl\. (2.8) 

The modulus of the first term can be bounded by E((VF — W^')^1{|W^-W^'|>A}) using \ j'[x) — f'{y)\ < 1 
for all real x and y (see [6], Lemma 2.2]). Using the Stein identity (j2.6p . the second summand can be 
represented as 



e({W - W')l{\w-W'\<A} f {{W + t)f{W + t)- Wf{W))dt 



-(W-W) 

Next observe that \Ux\< 0.82^^, see PO]: by the mean value theorem one gets 

(VF + t)/(VF + t)-VF/(W^) = VF(/(PF + t)-/(W^)) + t/(VF + t) = W{1 /'(VF + ut)tdn) +t/(W^ + t). 
Hence 

+ t)^{yv + 1) - w^{y/)\ < \w\ \t\ + \t\V2^/A = \t\{V2^/4 + \w\). 



14 



PETER EICHELSBACHER AND MATTHIAS LOWE 



Using K\W\ < ^yWW^ < 1 gives the bound. The term U2 can be bounded by 

E[{W - Vf^')^-^{0<(VK-VK')<^} ^{z<W<z+A}) ■ 



Under the assumptions of our Theorem we proceed as in [20] and obtain the following concentration 
inequality: 

E{{W - W')^I{o<iw-W')<a} hz<w<z+A}) < 3^(A + E{R)). (2.9) 
To see this, we apply the estimate 

E{{W - W'flo<^w-W')<A h<w<z+A) < mw - W'){f{W) - f{W'))), 

see |20j : here / is defined by f{x) := —1.5A for x < z — A, f{x) := 1.5A for x > z + 2A and 
f{x) := X — z — A/2 in between. Now we apply ()2.5p and get 

E{{W - W'fl{o<iw-W')<A} l{z<w<z+A}) < 2XE{Wf{W)) + 2K{f{W)R) < 3A{X + E{\R\)), 



where we used E(|M^|) < y^KiW"^) < 1. Similarly, we obtain 

U2 > -3A{X + E{R)). 

□ 

Remark 2.7. In Theorem 12.51 we assumed E(VF^) < 1. Alternatively, let us assume that E(l^^) is 
finite. Then the proof of Theorem 12.51 shows, that the third and the fourth summand of the bound 
()2.4p change to 



In the following corollary, we discuss the Kolmogorov-distance of the distribution of a random 
variable to a random variable distributed according to A^(0, cj^), the normal distribution with 
mean zero and variance o"^. 

Corollary 2.8. Let > and {W, W) be an exchangeable pair of real-valued random variables such 
that 

E(W'\W) = (1- + R (2.10) 

for some random variable R = R{W) and with < A < 1. Assume that E(M^^) is finite. Let Z^, be a 
random variable distributed according to N{0,a'^). If \W — W'\ < A for a constant A, we obtain the 
bound 



dK{W,Z^) < \IE(1-\e[{W' -W)''\W]] + + 1.5A^ VHR^) 



2X ' ' 'J y A y A 

. ^(^.^) + i.5.« (2.n) 
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KA^) - —UA^) = hx<z}{x) - F^{z) (2.12) 



Proof. Let us denote by /o- := fa,z the solution of the Stein equation 

X 

with Fa-{z) := -y^^ /foo ^^P(~^) ^^^y ^° ^^^^ identity fa,z{x) = (^fz{^), where fz 

is the solution of the corresponding Stein equation of the standard normal distribution, holds true. 
Using [61 Lemma 2.2] we obtain < f„{x) < a^, \ fUx)\ < 1, and \ f'„{x) - f^{y)\ < 1. With (l2lll 
we arrive at 

P{W <z)- F,{z) = Ti+T2 + T3 

with Tj's defined in (|2.7|) . Using the bounds of /g- and the bound of Ti is the same as in the proof 
of Theorem 12.51 whereas the bound of T2 changes to 



\T2\<a^Anm- 

Since we consider the case \ W — W'\ < A, we have to bound 

= -1^e({W -W')l{\w-W'\<A} r {nW + t)-f'{W))dt). 

\ J_(Vl/-VK') / 

Using the Stein identity (12.12p . the mean value theorem as well as the concentration inequality- 
argument along the lines of the proof of Theorem 12.51 we obtain 

Hence the corollary is proved. □ 
With (I2T0I1 we obtain E{W - W'f = '^E{W'^) - 2E(T^i?). Therefore 

Efl-i-E[(M^'-M.)Wl)= 1-^ + 50^. (2.13) 

\ 2A ) a'^ A 

so that the bound in Corollary 12.81 is only useful when E(l^^) is close to (and E(Ty i?)/A is small). 
An alternative bound can be obtained comparing with a 7V(0,E(T^2))_(jig|.j.ii3^|.iQj^_ 

Corollary 2.9. In the situation of Corollary \2.8l let Zw denote the A^(0, E(VF^)) distribution. We 
obtain 

A \ 16 4 y A 

Proof. With (I2T3]) we get E{W^) = a'^{^{E{W - W'f + 2E(T^i?))). With the definition of T2 and 
T2, as in (|2.7p we obtain 
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Remark that now cr^ in (j2.10p is a parameter of the exchangeable-pair identity and no longer the 
parameter of the limiting distribution. We apply ()2.12p and exchange every cr^ in (|2.12p with E(VF^). 
Applying Cauchy-Schwarz to the first summand and bounding the other terms as in the proof of 
Corollary 12.81 leads to the result. □ 

3. Berry-Esseen bounds for the classical Curie- Weiss model 
Let Q be the symmetric Bernoulli measure and < /? < 1. Then 

W:=Wn := ^ VXi. 
converges m distribution to a iV(0,cr2) with a'^ = {I - (3)-'^ : 

Proof of Theorem \1.2[ We consider the usual construction of an exchangeable pair. We produce a spin 
collection X' = {X^)i>i via a Gibbs sampling procedure: select a coordinate, say i, at random and 
replace Xi by X'- drawn from the conditional distribution of the z'th coordinate given {Xj)j^i. Let 
/ be a random variable taking values 1, 2, . . . , n with equal probability, and independent of all other 
random variables. Consider 

w ■■=w-^ + ^ = ^y^x, + ^ 

^ hp ^ / Ti ^ / Ti ^ — ^ * / r) 

Hence (VF, W) is an exchangeable pair and 



I Xj — X't 

w-w' = - ^ 



Let T := o{X\, . . . , X^). Now we obtain 



- W'\T\ = l^-Y^^\Xi- X[\T\ = -W- f;E[X,'|^]. 

^ 1=1 ^ 1=1 



The conditional distribution at site i is given by 

exp(xi/?mj(x)) 



with 



It follows that 



exp(/3mj(j;)) + exp(— /3mj(x)) 

rriiix) := — x,-, i = 1, . . . , n. 
n ^-^ 



E[X,'|.F] =E[Xi|(X,),^^] =tanh(/?mi(X)). 

Now ^i^^^^tanh(/3m,(X)) = -^i ^^^^^ (tanh(/3m,(X)) - tanh(/?m(X))) + ^ tanh(/3m(X)) 
R\ + i?2 with m,{X) := ^ 'Y^=\ Xi- Taylor-expansion tanh(x) = x + 0{x^) leads to 



R 



2 



_^„(X) + _OW.Yf) = ^W' + 0(-^). 
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Hence 

E[W -W'\W] = ^—^W + R= + R (3.1) 

n 

with A := cj2 := (1 - and R := 0(K) - Ri- Since \W - W'\ = l^^^^l < =: A, we 
are able to apply Corollary 12.81 From Lemma 11.131 we know that for q being the symmetric Bernoulh 
distribution and for < /3 < 1 we have E(Ty^) < const.. Applying this it follows that the fourth 
term in (j2.1ip can be bounded by l.Svl^^^^-^^-^ < (L^^}^E£ii^ and the third summand in ()2.1ip can be 
estimated as follows: 



^^^—^^ const. A ^ 1 ^^^^^^^^ 

4 J yjn 



A V 16 

Moreover we obtain E|i?| < E|i2i| + 0{^^). Since tanh(3;) is 1-Lipschitz we obtain \Ri\ < 
-^\mi{X) — m[X)\ < Therefore, with Lemma [1.131 we get E|i?| = 0{—^) and thus, the 

second summand in (j2.1ip can be bounded by 



const. ( — + 1.5^ )^ = O(^). 



To bound the first summand in (I2.1ip . we obtain {W — W'^'^ = + ir- Hence 

E[(VF - W'f\T\ = --^^Xi tanh(/3m,(X)), 

and therefore 

1 1 " 

I- E[{W-W'f\J^] = - VX, tanh(/3m,(X)) 
2A ^ n ^-^ 

1=1 

1 " 

= - ^Xi(tanh(/3mi(X)) - tanh(/3m(X))) + m{X) tanh(/3m(X)) 

^ i=i 
=: Ri + i?2- 

By Taylor expansion we get R2 = + and using Lemma 11.131 we obtain E|i?2| = 0{n~^). 

Since tanh(x) is 1-Lipschitz we obtain < Hence E\Ri + i?2| = 0{n^^) and Theorem 11.21 is 
proved. □ 



Now we discuss the critical case /3 = 1, when g is the symmetric Bernoulli distribution. For /? = 1, 
using the Taylor expansion tanh(x) = x — x'^/3 + 0{x^), (13. ip would lead to 

1 

for some R. Hence it is no longer possible to apply Corollary 12.81 Moreover the prefactor A := ^ 
would give growing bounds. In other words, the criticality of the temperature value l//9c = 1 can 



18 



PETER EICHELSBACHER AND MATTHIAS LOWE 



also be recognized by Stein's method. We already know that at the critical value, the sum of the 
spin-variables has to be rescaled. Let us now define 

1 " 

^■■=^.E^^- (3-2) 

i=l 

Constructing the exchangeable pair (W, W') in the same manner as before we will obtain 

1 

E[W - W'\W] = — + R{W) =: -Xip{W) + R{W). (3.3) 

n'^/^ 3 

with A = and a reminder R{W) presented later. Considering the density p{x) = C exp(— x^/12), 
we have 

— - = '4}{x). 

p[x) 

This is the starting point for developing Stein's method for limiting distributions with a regular 
Lebesgue-density p{-) and an exchangeable pair (W, M^') which satisfies the condition 

E[W - W'lW] = -\ip(W) + R(W) = -A^^^ + R(W) 

p{W) 



1„- 1 1 " 



with < A < 1. To prove (|3.3p . observe that 

E[W-Ty'|VF] = -VT- tanhfrnifX)). 



n n^/^ n 

l=\ 

X,, 



By Taylor expansion and the identity mj(X) = — ;r obtain 

i=l 

with R{W) such that E|i2(P^)| = 0(n~^). The exact form of R(W) will be presented in Section 5. 
4. The exchangeable pair approach for distributional approximations 

Motivated by the classical Curie- Weiss model at the critical temperature, we will develop Stein's 
method with the help of exchangeable pairs as follows. For a rather large class of continuous distri- 
butions, the Stein characterization was introduced in [23], following the lines of \22\ Chapter 6]. The 
densities occurring as limit laws in models of statistical mechanics belong to this class. Let I be a real 
interval, where —oo<a<b<oo. A function is called regular if / is finite on / and, at any interior 
point of /, / possesses a right-hand limit and a left-hand limit. Further, / possesses a right-hand limit 
f{a+) at the point a and a left-hand limit f{b—) at the point b. 

Let us assume, that the regular density p satisfies the following condition: 

Assumption (D) Let p be a regular, strictly positive density on an interval I = [a, b]. Suppose p 
has a derivative p' that is regular on I and has only countably many sign changes and being continuous 
at the sign changes. Suppose moreover that fjp{x) \ log{p{x)) \ dx < oo and assume that 

i^ix) := ^ (4.1) 
p[x) 
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is regular. 



In \23\ Proposition] it is proved, that a random variable Z is distributed according to the density p 
if and only if 

E(/'(Z) + = f{b-)p{b-) - f{a+)p{a+) 

for a suitably chosen class J- of functions /. The proof is integration by parts. The corresponding 
Stein identity is 

nx)+^{x)f{x) = h{x)-P{h), (4.2) 

where /i is a measurable function for which jj \h{x)\p{x) dx < oo, P{x) := J^^p{y) dy and P{h) := 
Ji h{y)p{y) dy. The solution / := fh of this differential equation is given by 

p{x) 

For the function h(x) := l{x<z}{^) l^t fz be the corresponding solution of (j4.2p . We will make the 
following assumptions: 

Assumption (Bl) Let p be a density fulfilling Assumption (D). We assume that for any absolute 
continuous function h, the solution of (j4.2p satisfies 



IIAII < CiWh'l < C2\\h'\\ and < C3\\h'l 

where ci,C2 and C3 are constants. 

Assumption (B2) Let p be a density fulfilling Assumption (D) We assume that the solution fz of 

fz (x) + fz (x) = 1|,<,} (x) - P{z) (4.4) 

satisfies 

\fzix)\ < di, \fi{x)\ < d2 and If'zix) - fz{y)\ < ds 

and 

\i^i^{x)fz{x)y\ = \{^fz{x)y\<d, (4.5) 

for all real x and y, where di,d2, d^ and ^4 are constants. 



At first glance, Condition (j4.5p seem to be a rather strong or at least a rather technical condition. 

Remark 4.1. In the case of the normal approximation, ^(x) = — x, we have to bound {xfz{x))' for 
the solution fz of the classical Stein equation. But it is easy to observe that \{xf'z{x))'\ < 2 by direct 
calculation (see |6| Proof of Lemma 6.5]). However, in the normal approximation case, this bound 
would lead to a worse Berry-Esseen constant (compare Theorem 12.51 with Theorem 14. 6p . Hence in this 
case we only use d2 = d-^ = \ and di = \/27r/4. 



We will see, that for all distributions appearing as limit laws in our class of Curie- Weiss models, 
Condition ()4.5p can be proved: 
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Lemma 4.2. The densities fk,^i,i3 in (|l-8p and (|1.9p and the densities in Theorem \1.4[ Theorem \1.7\ 
and Theorem \1.8\ satisfy Assumptions (D), (Bl) and (B2). 

Proof. We defer the proofs to the appendix, since they only involve careful analysis. □ 

Remark 4.3. With respect to all densities which appear as limiting distributions in our theorems, 
we restrict ourselves to bound solutions (and its derivatives) of the corresponding Stein equation 
characterizing distributions with probability densities p of the form 6^ exp(— afcX^'^). Along the lines 
of the proof of Lemma 14.2^ one would be able to present good bounds (in the sense that Assumption 
(Bl) and (B2) are fulfilled) even for measures with a probability density of the form 

p(x) = 6feexp(-afcy(x)), (4.6) 

where V is even, twice continuously differentiable, unbounded above at infinity, V ^ Q and V and 
l/V are increasing on [0, co). Moreover one has to assume that j ^,^^^^-*! can be bounded by a constant 
for X > d with some d G We sketch the proof in the appendix. It is remarkable, that this class of 
measures is a subclass of measures which are GHS, see Section 7. A measure with density p in (j4.6p is 
usually called a Gibbs measure. Stein's method for discrete Gibbs measures is developed in [8j. Our 
remark might be of use applying Stein's method for some continuous Gibbs measure approximation. 

Remark 4.4. In the case of comparing with an exponential distribution with parameter it is easy 
to see, that Assumption (D) and (B2) is fulfilled, see }23l Example 1.6] for (D) and [5, Lemma 2.1] 
for (B2). We have ^{x) = -/i and \\f,\\ < 1, ||/^|| < 1 and sup,,j^>o |/^(:e) - f',{y)\ < 1. Thus 

\{^{x)Mx)y\ = fi\fi{x)\ < ^^. 

Remark 4.5. From (j4.3p we obtain 

{l-P{z))P{x) 

Jz[x) = 1—. for x < z 

p(x) 



and 



P(z)(l-P(x)) 

fz[x) = — for x>z. 

p[x) 



Hence 

P{x)p'{x) p 



and 



ip{x) f;,{x) = — {1-P{z)) for x<z 

p^(x) 

{1-P{x))p'{x) ^^^ 

Jz{x) = [P[z)) for x> z. 

p'^[x) 



Therefore one has to bound the derivative of 

P{x)p'{x) {I- P{x))p'{x) 
p^{x) p'^{x) 

respectively, to check Condition ()4.5p . 



The following result is a refinement of Stein's result [22] for exchangeable pairs. 
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Theorem 4.6. Let p be a density fulfilling Assumption (D). Let {W,W') he an exchangeable pair of 
real-valued random variables such that 

¥.[W'\W] = W + Xip{W)- R{W) (4.7) 

for some random variable R = R{W), < A < 1 and tp defined in (j4.ip . Then 

E{W - W'f = -2XE[W^p{W)] + 2E[W R{W)]. (4.8) 

We obtain the following assertions: 

(1) Let Z be a random variable distributed according to p. Under Assumption (Bl), for any 
uniformly Lipschitz function h, we obtain 

\Eh{W) -Eh{Z)\ < 5\\h'\\ 

with 



C2E 



I - —E[{W -W'f\W) 



1 , "'/^2,w^ +f3jg|^_^,|3^^ /^T^_ 

4A A 



(2) Let Z be a random variable distributed according to p. Under Assumption (B2), we obtain for 
any ^ > 



dK{W,Z) < d2^jE[l-^E[{W> -WY\W]^ + (di + ^A) 



A 



With (US]) we obtain 

e(^- ^^[{W - W'f\W\^ = 1 + E[W^{W)] - ^(Ej^. 

Therefore the bounds in Theorem 14.61 are unhkely to be useful unless — E[Ty^/>(Ty)] is close to 1 and 
^^^^^ is small. Alternatively bounds can be obtained comparing not with a distribution given by 
p but with a modification which involves E[VF^/'(14/^)]. Let pw be a probability density such that a 
random variable Z is distributed according to pw if and only if 

E(E[Wi^{W)] f'{Z) + V(Z) f{Z)) = 

for a suitably chosen class of functions. 

Theorem 4.7. Let p be a density fulfilling Assumption (D). Let {W,W') be an exchangeable pair of 
real-valued random variables such that ()4.7p holds. If Zy/ is a random variable distributed according 
topw, we obtain under (Bl), for any uniformly Lipschitz function h that \Eh{W) —Eh{Zw)\ < (^'H/i'H 
with 

5' := ^(Var(E[(W^ - W'f\W])f'^ + ?,nW - W'\' + "^ + "^f^^ V^. 
2A 4A A 
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Under Assumption (B2) we obtain for any ^4 > 



dK{W,Zw) < ^{\^T{n{W-W'f\W]f'^+{di + d2^/^W^) + \A)y^ 



(4.10) 



a(^) + T^^— - ' 2A- 
Proof of Theorem \4-(^ Interestingly enough, the proof is a quite simple adaption of the results in [22] 
and follows the lines of the proof of Theorem 12.51 For a function / with |/(x)| < C(l + |x|) we obtain 

= ^{W -W'){f{W') + f{W))) 

= mw -W'){f{W')- f{W)))-2X&{4,{W)f{W)) + TK{f{W)R{W)), (4.11) 
which is equivalent to 

n^{W) f{W)) = -^H{W - W'){f{W) - f{W'))) + jnf{W) R{W)) (4.12) 

Proof of (1): Now let f = fh the solution of the Stein equation (j4.2p . and define 

K{t) := {W - T4^')(l{-(VF-iy')<t<o} ~ l{o<t<-(H/-w/')}) - ^• 
By (j4.12p . following the calculations on page 21 in [6j, we simply obtain 

\Eh{W) -Eh{Z)\ = \E{f'{W) + ib{W) f{W))\ 

= \E{f'{W) (^l-l-iW- W'f^ + (^(/'(^^) - nW + t)) Kit) dt 

+ jE{fiW)R{W))\. 
Using Jjjj \t\K{t) dt = ^E\W — W'\^, the bounds in Assumption (Bl) give: 



\Eh{W) - Eh{Z)\ < \\h'\\(c2E 



1- ^E{{W -W')^\W) +^E\W -W'\^ + ^^/E{^\ (4.13) 
2A 4A A J 

Proof of (2): Now let f = f^he the solution of the Stein equation ([33]). As in ([22]), using KUh . 
we obtain 

PiW<z)-P{z) = E{f'{W) +i^{W)f{W)) 

= E(^f{W){l - ^{W - W'f)'^ + ^E{2f{W) R) 

{W - W'){f{W) - f{W') -{W- W')f'{W)) 
= T1+T2 + T3. 
Now the bounds in Assumption (B2) give 



and 



iTil < d2\lE[ 1 - jE[{W' - Wy\W] 
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Using the decomposition (j2.8p of (— 2A) T3, the modulus of the first term can be bounded by (is E((M/^ — 
VF')^l|Hy_vi/'|>A}) • Using the Stein identity (j4.4p . the second summand can be represented as 

e((W - W')l{\w-W'\<A} f {-4'{W + t) f{W + t) + ij{W) f{W))dt] 

V J-{w-w') J 

+e((M/ - H^')l{|H/-iy'|<A} / {\w+t<z} - \w<-A)dt \ =■■ Ui + U2. 

With g{x) := {ip{x)f{x)y we obtain 

-iIj{W + t) f{W + t)+ i){W) f{W) = - [ g{W + s) ds. 

Jo 

Since \g{x)\ < d/^ we obtain < ^d^. 

Analogously to the steps in the proof of Theorem 12.51 U2 can be bounded by 

E((Ty - W'){f{W) - f{W'))) = 2E(/(Ty) R{W)) - 2AE(V'W f{W)) , 

where we applied (j4.12p . and where / is defined by f{x) := —l.bA for x < z — A, f{x) := 1.5^4 for 
x> z + 2Aand f{x) := x - z - A/2 in between. Thus U2 < 3A{E{\R\) + XE{\ip(W)\)) . Similarly we 
obtain U2 > -3^(E(|i?|) + XE{\iP{W)\)) . □ 

Proof of Theorem \4. 7\ The main observation is the following identity: 

iE(-E[n'v,(n')] /'(HO +*(»')/(»')) = E (/'(ty) ( ^''"' ~ ~ ) +E(m')fm) 

with T3 defined as in the proof of Theorem 14.61 Now we can apply the Cauchy-Schwarz inequality to 
get 

E\E[{W - W'f] - E[{W - W'f\W]\ < (Var(E[(H^ - W'f\W]) f^^. 
Now the proof follows the lines of the proof of Theorem 14.61 □ 

Remark 4.8. We discuss an alternative bound in Theorem 14.61 in the case that {'ip{x)fz{x)y cannot 
be bounded uniformly. By the mean value theorem we obtain in general 



-il^iW + t)f{W + t) + ij{W)f{W) = il^iW) (- / f'{W + st)tds) + f{W + t){- / ^p'{W + st)tds) . 

Jo Jo 

This gives 

\-ij{w + t)f{w + t) + ^{w)f{w)\<d2\ij{w)\\t\+di [ \ij'{w + st)\\t\ds. 

Jo 

Now we get the bound 

_L|C,|<^E(|*W|) + |LE(y) 



with ^ ^ 

V := (\W - W'\ l{iw-W'\<A} I I W{W + st)\ \t\ds dt] . 

\ J-iW-W')Jo J 
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Let US consider the example ^p{x) = —x^/3. Now 

ip'{W + st)\ = \{W + stf\ = \W'^ + 2stW + s'^t'^\, 

hence 

\{W + st)'^\\t\ < \t\\W^\ + |t2|2|VF||s| + It^lls^l 

and integration over s gives 

/ \ip'{W + st)\\t\ds < \t\\W^\ + \t^\\W\ + \t^\/3. 
Jo 

Integration over t leads to 



r i^\^l^'{w+st)\\t\d. 

j-iw-w) Jo 
with A := {W — W). Hence we get 



■s)<^\W'\ + ^-^\W\ + \!^ 



/43 aA 4 5 

E(V^) < — E|T^2| _^ — £1^1 _^ — 
y 2 3 12 

We will see in Section 5, that this bound is good enough for an alternative proof of Theorem 11.31 



5. Berry-Esseen bound at the critical temperature 



Proof of Theorem \1.3[ We start with (j3.3p , where W is given by (j3.2p . We will calculate the remainder 
term R{W) more carefully: By Taylor expansion and the identities mi{X) = m(X) — Xi/n and 
m{X) = ■:^^W we obtain 

1 1 " 11 ^A/^ w 

1=1 

with 



S{W) = 




From Lemma 11.131 we know that for g being the symmetric Bernoulli distribution and /3 = 1 we get 
E|VF|^ < const.. Using this we get the exchangeable pair identity 1^ with R{W) = C)(;^). With 
Lemma [4.2l we can now apply Theorem l4.6l using 

l^-^'l <^=-A- We obtain l.5AE{\ij{W)\) < 
const. ^ and^ = ^^. Using E\R{W)\ < const. ^ we get 

, 3 ^,E\R(W)\ 1 
(d, + -A)^^< const.-. 

Moreover we obtain 

E[iW-Wr\:F]=-^--^^j2x,tanHm,iX)). 
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Hence applying Theorem 14.61 we have to bound the expectation of 

1 

T := |-^Xitanh(mi(X))|. 



' n 

i=l 



Again using Taylor and mi{X) = m{X) — ^ and Lemma 11.131 the leading term of T is ^j2- Hence 
E(r) = 0{n^^/^) and Theorem [O] is proved. □ 

Remark 5.1. In Remark 14.81 presented an alternative bound via Stein's method without proving 
a uniform bound for {if:' {x) f z{x))' . As we can see, the additional terms in this bound are of smaller 
order than 0{n~^/'^), using A = n^^/^. 

Proof of Theorem \1.4\ (1) Let /5„ — 1 = and W = Sn/n^^'^- For the distribution function in 
Theorem 11.41 we obtain ^|)[x) = j x — ix^. Moreover we have 



E[W - W'\W] = f^W + + R{(3n, W) (5.1) 

ft Tl > o 

with R{Pn,W) = 0{n-^). With /3„ - 1 = ^ we obtain 

E[W - W'\W] = -^W + + R{Pn, W) = -^i^iW) + R{(3n, W) 

with R{f3n,W) = 0{n~'^). Now we only have to adapt the proof of Theorem 11.31 step by step, using, 
that the sixth moment of W is bounded for varying /3n, see Lemma 11.131 Hence by Lemma 14.21 and 
Theorem 14.61 part (1) is proved. 

(2): we consider the case - 1| = ©(n^^) and W = 5„/n^/*^. Now in (fSTj) . the term ^^^W will 
be a part of the remainder: 

E[w - W'\W] = J^— + R{p^, W) + =: -^HW) + R{Pn, W) 

with := x^/3. Along the lines of the proof of Theorem ll.3l we have to bound ^^y^ with A = -;^j2- 
But since by assumption 

1 (1 - (in) - Pn) „ 

hm = -3 = 0, 

applying Theorem 14.61 we obtain the convergence in distribution for any /?„ with — 1| <C n~^/^, 
and we obtain the Berry-Esseen bound of order 0{l/^/n) for any — 1| = 0{n~^). 



(3) Finally we consider - 1| > n~^/^ andW = ^J ^^-^S*™. Now we obtain 

^ — R B W B^ W'^ 

E[W - W'\W] = ^^W + ^ + — + W) =: -XiPiW) + R{/3n, W) 

Tl Tl Tl {1. Ptl ) *J 

with A = and Tp{x) = -x. We apply Corollary ES) with A = -^(1 - (3n)^^^, one obtains 

A-M3 = n-i/2(l - /3„)V2 and 

E\R{l3n,W)\ ^ const 



n(l - PnY 
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Moreover 



Hence 



E[iW - W'fm = ^^^^ - ^^^/"^ -E^^ tanh(/3„m,(X)). 



1-1.E[{W-W')^\W] 



n 



W^- — 



n n 



i=l 



n n2(l-/?„)2 3 



+ RiPn,W) 



0[ 



/3n 



)• 



Hence with |/3„ — 1| S> n ^^'^ we obtain convergence in distribution. Under the additional assumption 
|/3„ — 1| ^ n^^/^ we obtain the Berry-Esseen result. □ 



6. Proof of the general case 



Proof of Theorem \1. 7| Given q which satisfies the GHS-inequality and let a be the global minimum 
of type k and strength ^(a) of Gg. In case = 1 it is known that the random variable converges 
in distribution to a normal distribution N{0,a'^) with cj^ = ^(a)~^ — (3~^ = (cr"^ — see for 

example [H)\ V.13.15]. Hence in this case we will apply Corollarv 12.91 (to obtain better constants for 
our Berry-Esseen bound in comparison to Theorem 14. 7p . 

Consider k > 1. We just treat the case a = and denote ^ = /i(0). The more general case can 
be done analogously. For A; = 1, we consider tp{x) = —-^ with o"^ = — j3~^. For any A; > 2 we 
consider 



We define 

1 " 

i=l 

and W , constructed as in Section 3, such that 

, Xt — X'j 
W - W = — 

^l-l/(2fc) • 

We obtain 

1 1 1 " 

nW - W'm = -W- ^^TTTTMn 

i=l 

Now we have to calculate the conditional distribution at site i in the general case: 



Lemma 6.1. In the situation of Theorem \1.T[ if Xi is g-a.s. bounded, we obtain 

E{Xl\J^) = (m,(X) - ^G'g{l3,m,{X))) (l + 0{l/n)) 

with m.(X) := i Zj^^Xj = mix) - 
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Proof. We compute the conditional density g/3{xi\{Xi)i>2) of Xi = xi given (Xj)j>2 under the Curie- 
Weiss measure: 



gp{xi\{Xi)i>2) ■■ 



e/3/2n(E,>2^i^«+^D 



Je/3/2n(E,>2^'i^.+^'?)^(d2;i) 
Hence we can compute E[X(|^] as 

j-^^e/3/2n(E.>2-l^.+-?)^(dxi 



m[\:F] 



J-e/3/2n(E.>2-i^.+-?)^(rf2, 



Now, if < c ^)-a.s 



and 



Je/3/2n(E.>2-i^.+-?)^(d2;^)e-/3cV2n 



J e/3/2n.(E,>2 ^i^«+^?)^(d2;i)e/3^'/2'^ ' 
By computation of the derivative of Gg we see that 



e^M_,±PcVn ^ _ 1 G;(/3,mi(X))) (1 ± /?cVn). 



Je/3/2n(E,>2-i^.+-?)^(rf^^) ^ ^ /? 



□ 



Remark 6.2. If we consider the Curie-Weiss model with respect to Pn,i3, the conditional density 
gp{xi\{Xi)i>2) under this measure becomes 

p/3/2n(E,>2^i^«) 



gp{xi\{X, 



i i>2 



j-e/3/2n(E,>2^i^0^(^2;i)' 



Thus we obtain K{Xl\J^) = (mj(X) — without the boundedness assumption for the 

Applying Lemma l6.ll and the presentation (jl.Sp of Gg, it follows that 

With mj(X) = m{X) - ^ and m{X) = :^jj(2k)W we obtain 

1 1 " 1 1 



and 



^i-i/(2fc) ^ z — / ' ' n n 

i=l 



2fe-l 



„i-i/(2fc)„^y5(2A;-l)! ' ' ni-i/(2fc) /3(2A; 



z=o ^ ^ i=l 
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For any A; > 1 the first summand {I = 0) is 



1 TT,2fc-l 1 



„.-i,(2.-i„»'"-'-;;;rj«H'). (o.i) 

To see this, let k = I. Since we set 4'"{0) = 1, we obtain /i(0) = [3 — 0^ and therefore ^^{^)W = 
(1 — (3)W . In the case k >2 we know that /3 = 1. Hence in both cases, ()6.ip is checked. Summarizing 
we obtain for any k > 1 

E[W - W'\W] = ^il^{W) + R{W) =: -XipiW) + R{W) 



with 



With Lemma 11.131 we know that Ejl^/'p'^ < const. We will apply Corollary 12.91 if A; = 1 and Theorem 
14.71 for k >2. In both cases we apply Lemma 14.21 Since the spin variables are assumed to be bounded 
^>-a.s, we have 

n-^ 2fc 

Let k = 1. Now X = ^, A = const. /n^^/^, E(M^^) < const. The leading term of R is W/n'^. Hence 
the last four summands in (j2.14p of Corollary 12.91 are 0{n~^^'^). 

For A; > 2 we obtain ^E(|V'(VF)|) = ©(n^"^) and \ {^^) = 0{n-h~^). The leading term in the 
second term of R{W) is the first summand {I = 1), which is of order C'(n~'^^fc"'"2fc ). With A = n""^ 
we obtain 

!!M)<!E(lE!)^o(„A-) and !!aEn=o(„.A). 

A An^ An^ ^ 

Hence the last four summands in (j4.10p of Theorem 14.71 are 0{n~'^^^). 

Finally we have to consider the variance oi ^¥.[{W -W'f\W]. Hence we have to bound the variance 

of 

m n n , . 

^ E + ^ E ^[(^0 V] ^-Y^xAmiiX)- -G'^iP, m,{X)) (1 + 0(l/n)). (6.2) 

i=l 1=1 i=l ^ ^ ^ 

Since we assume that g G GHS, we can apply the correlation-inequality due to Lebowitz (see Remark 

KiXiXjXkXi) -E{Xaj)EiXkXi) -E{XiXk)nXjXi) -nXiXiMXjXk) < 0. 
The choice i = k and j = I leads to the bound 

Cov{Xf,X]) = E{XfXf)-E{Xf)E{X]) < 2{E{XiXj)f. 
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With Lemma n. 131 we know that (E(XjXj))^ < const. n This gives 

n n 
i=l i=l l<*<i<" 

Using a conditional version of Jensen's inequahty we have 

Var(E(:^E^^'l^))^Var(i-f;X.^). 



'2n^ M - \2n 

i=l i=l 



Hence the variance of the second term in (|6.2|) is of the same order as the variance of the first term. 
Applying (|1.5|) for Gq, the variance of the third term in (|6.2|) is of the order of the variance of W'^/n^^'^. 
Summarizing the variance of (j6.2p can be bounded by 9 times the maximum of the variances of the 
three terms in (j6.2p . which is a constant times n~'^/^ , and therefore for A; > 1 we obtain 

\ai(^^mW -W'f\W]^^ ' =0{n-^'^). 
Note that for k>2 

-¥.[W^Ij{W)] ~ ~E(Ty2fc)- 
Hence we compare the distribution of W with a distribution with Lebesgue-probability density pro- 
portional to exp(- 2^^^^) . □ 

Proof of Theorem. \1.8[ Since a = and k = 1 for (3^1 while a = and k > 2 for (3 = 1, Gq{-) can 
now be expanded as 

= G(0) + + ^.^'^ + as .^0. 



Hence ^ G'^is) = ^s+ s„{2k-i)\ ^'^'''^ + ^(s^'')- With LemmaOand fJ-i = (l - ^„)/3„ we obt. 



We get 



/3„(2fc-l) 

E[X,\J-] = (3^m,{X) - .^-^J^—^m,{Xr~' (1 + 0(l/n)). 



am 



The remainder R{Pn, is the remainder in the proof of Theorem 11.71 with fi exchanged by fik and (3 
exchanged by 

Let /3n - 1 = and W = n^/^^fc)-! ^n^^ q^^^^-j^ 

E[VF - W'\W] = ^HW) + + i2(/3„, W^), (6.3) 

where ip{x) = ^x— ^ (-zk-iy. ^"^^'^ • ™ proof of Theorem 1 1.71 we obtain that R{Pn, W) = 0{n~^). 
Now we only have to adapt the proof of Theorem 1 1 . 71 step by step, applying Lemma [1.131 Lemma [42] 
and Theorem 14.71 
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Let \Pn - 1| = 0{l/n) and W = n^/(^'')-^ J2i=i ^i- Now in the term will be a part of 

the remainder: 

with il){x) = — ^2k'^i)\ x^'^^^- Following the lines of the proof of Theorem 11.71 we have to bound 
¥.\R{0ri,w)\ ^j^j^ ^ ._ — . Since by our assumption for (/5„)n we have 



lim i^^^=/3„(l-/3nK-^/'= = 0. 
71-+00 A n 



Thus with Theorem 14.71 we obtain convergence in distribution for any /3„ with — 1| ^ n ^/^), 
Moreover we obtain the Berry-Esseen bound of order 0{n~^/^) for any — 1| = 0{n^'^). 



Finally we consider — 1| ^ n ^/^^ and W = \J ^"—^^Sn- A little calculation gives 

with ip{x) = —X and A = ^~J^" ■ Now we apply Corollarv l2.9[ With A := '^""^^'^^^"^ we obtain 

const.(l-/3n)^/^ , E\R(/3n,W)\ const 
< = and : < 



X - A - nfc-i(l-/3n)'=' 

Remark that the bound on the right hand side is good for any — 1| ^ n~^^~^^''\ Finally we have 
to bound the variance of ^E[(VF - The leading term is the variance of 



^p^X,(m,iX)-^G'^{P,m,iX))y 



which is of order Hence with — 1| ^ n ^^^^ we get convergence in distribution. 

Under the additional assumption that — 1| S> n^^^/'^^^/^'^''^^ we obtain the Berry-Esseen bound. □ 

Proof of Theorem \1.1(A We apply Theorem 14. 7[ For unbounded spin variables Xj we consider 
and apply Lemma [6. II to bound ^Y^Var(E[(H^ — Ty)2|VF]) exactly as in the proof of Theorem II .71 By 
Theorem 14.71 it remains to bound ^IE|VF — W'f. With A = we have 

\m -wf = ^^,nxi -x'j\' = - x[\\ 

Now E\Xi-X[\^ < E\Xi\^ + 3E\XfX[\+3E\Xi{X[)^\+E\X[\^. Using Holder's inequality we obtain 
E\XfX[\ < {E\Xiff^^ {E\X[fY^^ < max(E|Xl|^E|X[|3). 

Hence we have 

hE\W -Wl"" < -^^max{E\Xif,E\X[f). 
Thus the Theorem is proved. □ 
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7. Examples 

It is known that the following distributions g are GHS (see [11, Theorem 1.2]). The symmetric 
Bernoulli measure is GHS, first noted in [9]. The family of measures 

Qa{dx) =a6x + ((1 - a)/2) {S^-i + S^+i) 

for < a < 2/3 is GHS, whereas the GHS-inequality fails for 2/3 < a < 1, see [211 p.l53]. GHS 
contains all measures of the form 



Qv{dx) '■= { exp(— y(x)) dx) ^ exp(^—V{x)) dx, 
Jr 



where V is even, continuously differentiable, and unbounded above at infinity, and V is convex on 
[0,oo). GHS contains all absolutely continuous measures g £ B with support on [—a, a] for some 
< a < oo provided g{x) = dg/dx is continuously differentiable and strictly positive on (—a, a) 
and g'{x)/g{x) is concave on [0, a). Measures like g{dx) = const. exp(—ax'^ — hx"^) dx or g{dx) = 
const. exp(— acoshx — bx^) dx with a > and b real are GHS. Both are of physical interest, see [llj 
and references therein). 

Example 7.1 (A Curie- Weiss model with three states). We will now consider the next simplest 

example of the classical Curie-Weiss model: a model with three states. Observe, that this is not the 

Curie- Weiss-Potts model [15], since the latter has a different Hamiltonian. Indeed the Hamiltonian 

considered in [15] is of the form ^ ■ Sx^^Xj ■ It favours states with many equal spins, whereas in our 

case the spins also need to have large values. We choose g to be 

2 1 1 
= g'^o + + e'^v^- 

This model seems to be of physical relevance. It is studied in [24]. In [3] it was used to analyze the 
tri-critical point of liquid helium. A little computation shows that 

d^ g sinh(3; \/3) \/3 (cosh(x \/3) — 1) ^ ^ 

ds^ ~ 12 cosh(x ^/3) + 6 cosh(x ^/3)2 + cosh(x ^/3)3 + 8 ~ 

for all s > 0. Hence the GHS-inequality (|1.10|) is fulfilled (see also |1H Theorem 1.2]), which implies 
that there is one critical temperature (5c such that there is one minimum of G for [5 < Pc and two 
minima above [3c- Since Var^(Xi) = 2^ • 3 = 1 we see that /3c = 1. For (3 < j3c the minimum of G is 
located in zero while for /5 > 1 the two minima are symmetric and satisfy 

_ V3smh{V3(3s) 
~ 2 + cosh(^/3/3s ) ' 

Now Theorem O and [TH] tell that 

• For (3 < 1 the rescaled magnetization 5„/-y/n satisfies a Central Limit Theorem and the limiting 
variance is (1 — Indeed, ^(/>p(0) = Varg(Xi) = 1. Hence fix = (3 — 0^ and cr^ = 

Moreover we obtain 

Pni^<z)-^W{Z) 



sup 



n 



< ^ 
'n 
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For /? = /3c = 1 the rescaled magnetization Sn/n^^^ converges in distribution to X which has 
the density /s^e.i- Indeed ^2 is computed to be 6. Moreover we obtain 



sup 



P. 



5/6 



n 



< 



c 

"T73 



n 



where the derivative of -F3 is the rescaled density exp ( 



6E(iy«) 



)• 



If I3n converges monotonically to 1 faster than n~^/^ then converges in distribution to F3, 
whereas if j3n converges monotonically to 1 slower than n~^/^ then satisfies a Central 

Limit Theorem. Eventually, if |1 — Pn\ = jn~'^^^, -^f^ converges in distribution to a random 
variable which probability distribution has the mixed Lebesgue-density 

exp -Cur 77- -7— 



with cw = TioHW^) - 7E(VF^ 



Moreover we have 



sup 



Pr, 



n 



5/6 



< z 



1 



exp 



-w 



120 



< 



n 



C 



Example 7.2 (A continuous Curie- Weiss model). Last but not least we will treat an example of a 
continuous Curie-Weiss model. We choose as underlying distribution the uniform distribution on an 
interval in M. To keep the critical temperature one we define 

—, = 7-I[-a,a] 

axi 2a ^ ' 

with a = \/3. Then from a general result in |13l Theorem 2.4] (see also [11^ Theorem 1.2]) it follows 
that g{xi) obeys the GHS-inequality (ll.lOp . Therefore there exists a critical temperature (3c, such that 
for (3 < (3c zero is the unique global minimum of G and is of type 1 , while at (3c this minimum is of type 
k>2. This (3c is easily computed to be one. Indeed, ^ll = (3 - (3'^(()"{0) = (3 - (3'^Ef,{Xf) = (3{1 - (3), 
since g is centered and has variance one. Thus ni vanishes at (3 = (3c = 1- Eventually for (3 > 1 there 
are again two minima which are solutions of 



tanh(\/3/3x) 



1 

(3x + -. 

X 



Now again by Theorems 11.71 and 11.81 



• For P < 1 the rescaled magnetization Sn/y/n obeys a Central Limit Theorem and the limiting 
variance is (1 — /3)~^. Indeed, since 'Eg{Xf) = 1, fii = (3 — P"^ and a"^ = jz^- 

• For /? = /3c = 1 the rescaled magnetization Sn/n^^^ converges in distribution to X which has 
the density /4,6/5,i- Indeed /i2 is computed to be 



Moreover we obtain 



Sn 



+ 3 



sup 



p 



n 



7/8 



< Z 



F,{z) 



< 



n 



- ^ 
" 5' 

C 

T74 
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where the derivative of F4 is the rescaled density exp(— gj^^^?^) . 

If Pn converges monotonically to 1 faster than n~^/^ then ^^^g converges in distribution to F4, 
whereas if /3„ converges monotonically to 1 slower than n~^/^ then satisfies a Central 

Limit Theorem. Eventually, if |1 — /3„| = 7?^"^/^, ^^^g converges in distribution to the mixed 
density 



C 

< 



with cw = 7r^E{W^) - 7E(M/^). Moreover we have 



!) 

sup 



ni/4- 

Note that there is some interesting change in limiting behaviour of all of these models at criticality. 
While for /3 < 1 all of the models have the same rate of convergence for the Central Limit Theorem 
behaviour, in the limit at criticality the limiting distribution function as well as the distributions which 
depend on some moments of W becomes characteristic of the underlying distribution g. Moreover the 
rate of convergence differs at criticality (for k > 3). 



8. Appendix 



.1) 



Proof of Lemma \4-^ Consider a probability density of the form 

p{x) ■.= pk{x) := bkexp{-akX^'') 

with bk = /jg exp(— OfcX^'^) dx. Clearly p satisfies Assumption (D). First we prove that the solutions fz 
of the Stein equation, which characterizes the distribution with respect to the density (|8.ip . satisfies 
Assumption (B2). Let fz be the solution of 

f'zix) + ij{x)fz{x) = l{.<z}{x) - P{Z). 

Here 'il^{x) = —2kakx'^''^^. We have 



(1 - P{z)) P{x) exp(afcX^*^)6^, ^ for x < z, 
P{z) (1 - P{x)) exp(afcX^'')6^^ for x > z 



(8.2) 



with P{z) := J^^p{x) dx. Note that fz{x) = f-z{—x), so we need only to consider the case z > 0. 
For X > we obtain 



1 - P{x) < 



whereas for x < we have 



By partial integration we have 



2k akx"^^' 
bk 



-exp(-afcx^ ) 



^^^^ ^ ^, 7i2k 1 exp(-afcx^^). 

2kak\xr''~^ ^ 



(8.3) 
(8.4) 



2k Uk 



2k ak 



( 2k\ 

-^exp{-akt } 



00 ^00 



X Jx 



exp(-afct ) dt 
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Hence for any x > 



X 



exp(-afcx2'=) < 1 - P( 



x). 



With dOD we get for x > 

— f exp(afcx^'') y exp(-aA;t^'') dt \ = -1 + IkakX^^'^ exp(afcx^'') j exp{-akt'^^) dt < 0. 
So exp^OfcX^'^) exp(— Ofct^'^) (it attains its maximum at x = and therefore 

r-co ^ 

exp{akx'^^) bk J exp{-akt'^'') dt < -. 
Summarizing we obtain for x > 

'1 bk 



(8.5) 



1 — P(x) < min 



2 ' 2k ak x^ 



2k^ exp^-afex 



2k \ 



(8.6) 



With (1831) we get for x < 

dx 



afcX 



exp(ofcX^'') / eyip{-akt'^'') dt\ = l + 2kakx'^'' ^ exp 
So exp(afcX^'^') exp(— a^t^'^) attains its maximum at x = and therefore 

/X 2 
exp(-afct^'') dt < -. 
-oo ^ 

Summarizing we obtain for x < 

h 



/X 
exp(-afct^''') dt > 0. 
-oo 



P{x) < mini — , 



1 



2 2A; ttfc |x 



2fc- 



— ^ exp(-afcx^''). 



(8.7) 



Applying (j8.6p and (j8.7p gives < /^(x) < 2^ for all x. Note that for x < we only have to 
consider the first case of (|8.2p . since 2; > 0. The constant 2^ is not optimal. Following the proof of 
Lemma 2.2 in |6] or alternatively of Lemma 2 in |22[ Lecture II] would lead to optimal constants. We 
omit this. It follows from (18.21) that 



(l-Piz)) 



Ij^x'^k 1 2A; Ofc P(x) exp(afcX^'^)6 



P{z) 



{1 — P{x))2k akX^^ ^ exp(aA;X 



for X < z, 
for X > z. 



With dOD we obtain for < x < z that 
/^(x)<(l-P(z)) 



2k akPiz)exp{akZ^^)b 



2k\u-l 

k 



+ 1 < 2. 



The same argument for x > z leads to |/^(x)| < 2. For x < we use the first half of (|8.2p and 
apply ()8.4p to obtain |/^(a;)| < 2. Actually this bound will be improved later. Next we calculate the 
derivative of — ^/^(x) /^(x): 



(-V'(x)/,(x))' 



bk 



EM. 



P(x)e"fc^'' {2k{2k - l)afex2'=-2 + (2kf alx^^-"^] + 2A;afca 



■''-'bk 



[1 - P(x))e'*fe^'' ( 2A;(2A; - l)afcx2'=-2 + {2kfalx^'-' ] - 2kakx'^''-% 



x < z, 
X > z. 
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With (j8.5p we obtain {—tjj{x)fz{x))' > 0, so —tp{x)fz{x) is an increasing function of x (remark that 

for X < we only have to consider the first half of (j8.2p ). Moreover with (j8.3p . (|8.4p and (jS.Sp we 
obtain that 

Yiia 2kakx'^^~^f\{x)=P{z)-l and Mm 2k x^^^^ f ^{x) = P {z) . (8.10) 

X — > — OO X — >oo 

Hence we have \2k x"^^""^ f z{x)\ < 1 and \2k ak{x'^^~^ fz{x) — u'^^^^ fz{u)^\ < 1 for any x and u. 
From (|8.3p it follows that /^(x) > for all x < z and /^(x) < for x > z. With Stein's identity 
/^(x) = -^(x)/,(x) + 1|^<^} - P(z) and ([5101) we have 

< f'zix) <-ij{z)fziz) + l- P{z) <1 for x<z 

and 

-1 < -'ip{z)fziz) - P{z) < /^(x) < for x> z. 
Hence, for any x and y, we obtain 

|/;(x)| < 1 and \f-Ax) - fM\ < max(l, -^{z)fz{z) + 1 - P{z) - {-^{z)fz{z) - P{z))) = 1. 

Next we bound {—ip{x)fz{x)y. We already know that {—ip{x)fz{x))' > 0. Again we apply (j8.3p and 
to see that 

(-V(x)/,(x))'< ^^"^ 



|x| 

for X > z > and all x < 0. For < x < z this latter bound holds, as can be seen by applying 
this bound (more precisely the bound for {—tl){x)fz{x)y for x > z) with — x for x to the formula 
for {ip{x)fz{x)y in X < z. For some constant c we can bound {'4){x)fz{x)y by c for all |x| > . 
Moreover, on [— '^^~^ ] the continuous function {—il){x)fz{x)y is bounded by some constant d, 
hence we have proved 

I - {'il){x)fz{x)y\ < max(c,d). 

The problem of finding the optimal constant, depending on k, is omitted. Summarizing, Assumption 
(B2) is fulfilled for p with (i2 = da = 1 and some constants di and ^4. 

Next we consider an absolutely continuous function /i : M ^ M. Let fh be the solution of the Stein 
equation ()4.2p . that is 

A(x) = ^y {h{t)-Ph)p{t)dt=--^ J {h{t)-ph)p{t)dt. 

We adapt the proof of [6l Lemma 2.3]: without loss of generality we assume that /i(0) = and put 
Co := supa; \h{x) — Ph\ and ei := sup^, |/i'(x)|. Form the definition of fh it follows that \ fh{x)\ < gq-^- 
An alternative bound is ci ei with some constant ci depending on E|Z|, where Z denotes a random 
variable distributed according to p. With (j4.2p and (j8.5p . for x > 0, 

POO 

\f'hix)\ < Hx) - Ph\ - V(^)e"'=^'' / \h{t) - Ph\e-'"^''' dt < 2eo. 
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An alternative bound is C2 ei with some constant C2 depending on the {2k — 2)'th moment of p. This 
is using Stein's identity (j4.2p to obtain 



(/i'(t)-V'(t)/(t))e-"^*"(it 



The details are omit. To bound the second derivative we differentiate ()4.2p and have 

fl{x) = {^P\x) - ^P'{x))^{x) - ^ix){hix) - Ph) + h\x). 
Similarly to [6, (8.8), (8.9)] we obtain 

/X poo 
h'{t)P{t)dt- h'{t){l- P{t))dt. 
-OO J X 

It follows that 

A(x) = --e"'=-'"(l-P(x)) / h'{t)P{t)dt--e--^' P{x) / h'{t){l-P{t)) 

Ok J-oo Jx 

Now we apply the fact that the quantity in (j8.9p is non-negative to obtain 



dt. 



\f;{x)\ < \h'ix)\ + \{ijHx)-i^'ix))Mx)-ij{x){hix)-Ph)\ 



< \h\x)\ + 



+ 



-iPix) - —U\x) - V'(x))e"*^''(l - 
h 



1 



h!{t)P{t)dt 



^(2;) - - V'(x))e"'=^ P(x) ) / h'{t){l- P{t))dt 



< 



1 



\h!{x)\ + ei ( + Y^[r{x) - V'(^))e'^'=" (1 - P{x)) 
1 



P{t) dt 



+ei[-^{x) + y^{^\x)-ilj'{x))e''''^ 'P{x)] / (1-P(t))(it. 



Moreover we know, that the quantity in (|8.9p can be bounded by ^r-p, hence 



2bk {2k - 1) 



P{t)dt+ / . 



Now we bound 



/X poo poo 

P{t) dt+ (1 - P{t)) dt\ = \xP{x) - x{l - P{x)) + 2 I tp{t) dt\ < 2\x\ + 2K\Z\, 
-00 J X J X 

where Z is distributed according to p. Summarizing we have < C3 sup^, for some constant 

C3, using the fact that fh and therefore and are continuous. Hence fh satisfies Assumption 
(Bl). □ 

Sketch of the proof of Remark \4-3\ Now let p{x) = 6^ exp(— afcy(x)) and V satisfies the assumptions 
listed in Remark l4.3i To proof that fz (with respect to p) satisfies Assumption (B2), we adapt (|8.5p 
as well as (|8.6p and (|8.7p . using the assumptions on V. We obtain for x > 

V'{x) 



bk 



V"{x) +akV'{xy 



^ exp{-akV{x)) <1-P{a 
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and for X > 



and for X < 



1 - P(x) < minQ, — ^) exp(-a, F(x)) 



Estimating {—xp{x)fz{x)y gives 

(-^(x)/,(x)y< const. 

By our assumptions on V, the right hand side can be bounded for x > d with d G M+ and since 
ip{x)fz{x) is continuous, it is bounded everywhere. □ 

Acknowledgement. During the preparation of our manusscript we became aware of a preprint of S. 
Chatterjee ans Q.-M. Shao about Stein's method with applications to the Curie- Weiss model. As far 
as we understand, there the authors give an alternative proof of Theorem 1.2 and 1.3. 
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